CDS
Accession Number | TCMCG075C05892 |
gbkey | CDS |
Protein Id | XP_017970903.1 |
Location | complement(join(6535671..6535811,6536609..6536720,6537036..6537118,6537258..6537554,6537969..6538244,6538331..6539071,6539730..6540335)) |
Gene | LOC18608196 |
GeneID | 18608196 |
Organism | Theobroma cacao |
Protein
Length | 751aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018115414.1 |
Definition | PREDICTED: KH domain-containing protein HEN4 isoform X1 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGGCAGCACCTTCCTCTCTATACCAACAAAGCGAGCCATGCCCGACGCGACCCCTTCTTCGAACGGCCCCTCCAAGCGTTCCAAGCCTCCGGCTACTCCTCTCCCTGTCCCCCCTGGTCACGTTGCCTTCCGTCTCCTCTGCCACGTGTCCCGTGTCGGTGGCGTCATCGGCAAGTCAGGCAGTGTCATCAAGCAACTGCAACAGGCTACCGGTTCCAAGATCCGAATTGAGGATGCCCCGGCTGAAAGCCCGGACCGGGTTATTACCGTCATAGGCCCGAATGCTGTTAATACCAAGATTGTACTGAATTATGGTAGCCTTGGCAATGGTTACGGTAGTAGCGTTGAGGAAATCGATGTGTCCAAGGCGCAGGAGGCGTTAGTGAGAGTGTTCGAGAGGATTCTGGAGGTGGCGGCGGAGAGCGATGGAGTGGCCTTGGTGATGGTTTCTTGTCGGTTATTGGCGGAGGTTAAGCACGTTGGGAGCGTGATAGGGAAAGGAGGTAAGGTAGTGGAGAAGATAAGGGAAGATACTGGGACCAAAATTAGGGTTTTGACGGATAAGCTACCGGCTTGTGCCAGCCCCACGGAGGAGATTGTGGAGATTGAAGGAGGTGTTTTAGCTGTAAAGAAAGCGCTTGTTGCTGTCTCACATCGCCTCCAAGATTGCCCTCCTGTCAATAAAACAAGGATAACTGAAAACAGGATCATTGAATCAGTTCCTTCAGAGGCTTGGCATAAACCTATTGAGTTACTTCCTCAGGAGACTTTGCGAAGGCCTATTGACTTATTTCCCCAGGACACTTTGTACAGGCCTATTGACTTACTTCCTCAGGAGACTTTGCGCAGAGCTATTGAGGTACTTCCCCAGGAGACTTTGCACAGACCTATTGAGGTTGTTCCACAGGAGCCATTGCACAGACCTATTGATGTTGTTCCACAGGGCTCCTTGCGTAGACATATTGATGTTGTTCCACAGGGCTCCTTGCGTAGACCTATTGATGTTGTTTCTCAGGAGGCTTTACCTGATCTGAATATAGATCATCTTTCACAGCGTAGTTCCCTGATGCCTACTATATCCAGTAGCTCCATCAGTTATGCCACCAGAGTTCATCCTTTGTCTCTAGAGTCCGAGAATGCTTCTCCATTGGATACAAAAACATTGCAGCATGAAGTGGTTTTTAAAATTCTTTGCTCCAGTGATAGGGTTGGGGGTGTTATTGGAAAGGGAGGTGCAATCATTAAGGCTCTTCAAAGTGATACAGGAACTACTATTACCATTGGACCTACACTCACTGATTGTGATGAACGGTTGGTAACTGTTACTGCATCAGAGGTCAGTCAGAACCCAGAATCACAGTATTCTCCAGCACAAAAGGCTGTTGTGCTTGTTTTTGTAAGAGCTTTGGAGGCGTCAATTGAAAAAGGGCTAGATTCAGGCTCAGGTAAGGGTTCAAATGTCACAGCTCGGCTTGTAGTTCCATCAGGCCAAGTTGGCTGTCTGTTGGGAAAAGGAGGTGCAATAATTTCTGAAATGCGTAAAGTGACTGGTACCGGCATTCGAATTTTGGGATCTGACCAGGTCCCTAAGTGTGTCACTGAAAATGACCAAGTGGTGCAGATTTCAGGAGGGTATTTGAATGTGAAAGATGCTATATATCATGTTACTGGTAGACTACGAGATAACCTATTTTCTAGCACACTGAAGAATGCTGGAGCAAAAAGTAGTTCTGCTGTTTTAACTGAGACCAGTCCTTATGAAAGATTGATGGACACTGCCCCTCTTGGGCTGCAAGTATCAAGTGGTGTTTCTTATAATCTTAGTCGGCATACGACATTGGCACCGAATAGTACGGATTCCTTTGGACTTTCCCGTAGTTTAGATTGCCCTCATTCACCAGGGTTATGGACATCAGAGACAGGTAATGTACTGAATCCAAGGAGCACCACAGATATCGGCAGAGGATTGACTTCTCTTAGAGGTGGATTTGAACTTGGCAGTGGAAACAGATCTGCTATTGTGACAAATACAACTGTAGAGATTAGAGTTCCTGAGAATGTTATTGACTCTGTTTATGGGGAGAATGGTCGCAATCTGTCTCGGTTGAGAGAGATCTCTGGTGCCAAGGTCATAGTGCATGAACCTCAAATAGGAACAAGTGACAGGATTGTTGTCATATCTGGGACACCTGATCAAACCCAGGCGGCTCAGAGCCTCCTTCAAGCTTTCATCCTCACTGGTCCATCACGTTGA |
Protein: MGSTFLSIPTKRAMPDATPSSNGPSKRSKPPATPLPVPPGHVAFRLLCHVSRVGGVIGKSGSVIKQLQQATGSKIRIEDAPAESPDRVITVIGPNAVNTKIVLNYGSLGNGYGSSVEEIDVSKAQEALVRVFERILEVAAESDGVALVMVSCRLLAEVKHVGSVIGKGGKVVEKIREDTGTKIRVLTDKLPACASPTEEIVEIEGGVLAVKKALVAVSHRLQDCPPVNKTRITENRIIESVPSEAWHKPIELLPQETLRRPIDLFPQDTLYRPIDLLPQETLRRAIEVLPQETLHRPIEVVPQEPLHRPIDVVPQGSLRRHIDVVPQGSLRRPIDVVSQEALPDLNIDHLSQRSSLMPTISSSSISYATRVHPLSLESENASPLDTKTLQHEVVFKILCSSDRVGGVIGKGGAIIKALQSDTGTTITIGPTLTDCDERLVTVTASEVSQNPESQYSPAQKAVVLVFVRALEASIEKGLDSGSGKGSNVTARLVVPSGQVGCLLGKGGAIISEMRKVTGTGIRILGSDQVPKCVTENDQVVQISGGYLNVKDAIYHVTGRLRDNLFSSTLKNAGAKSSSAVLTETSPYERLMDTAPLGLQVSSGVSYNLSRHTTLAPNSTDSFGLSRSLDCPHSPGLWTSETGNVLNPRSTTDIGRGLTSLRGGFELGSGNRSAIVTNTTVEIRVPENVIDSVYGENGRNLSRLREISGAKVIVHEPQIGTSDRIVVISGTPDQTQAAQSLLQAFILTGPSR |